Hierarchical Infrastructure for Large-Scale Distributed Privacy-Preserving Data Mining
نویسندگان
چکیده
Data Mining is often required to be performed among a number of groups of sites, where the precondition is that no privacy of any site should be leaked out to other sites. In this paper, a hierarchical infrastructure is proposed for large-scale distributed Privacy Preserving Data Mining (PPDM) utilizing a synergy between P2P and Grid. The proposed architecture is characterized with (1) its ability for preserving the privacy in data mining; (2) its ability for decentralized control; (3) its dynamic and scalable ability; (4) its global asynchrony and local communication ability. An algorithm is described to show how to process large-scale distributed PPDM based on the infrastructure. The remarks in the end show the effectiveness and advantages of the proposed infrastructure for large-scale distributed PPDM.
منابع مشابه
Towards Secure Privacy Preserving Data Mining over Computational Grids
Grid computing facilitates the realization of large-scale intraand inter-organization collaborative computer applications by harnessing computing, storage, and networking resources available over the Internet. The concept of grid computing paradigm is analogous to that of electricity power grid where electricity sources are connected together in a grid and consumes’ needs for electricity are ad...
متن کاملPrivacy Preserving Frequency Mining in 2-Part Fully Distributed Setting
Recently, privacy preservation has become one of the key issues in data mining. In many data mining applications, computing frequencies of values or tuples of values in a data set is a fundamental operation repeatedly used. Within the context of privacy preserving data mining, several privacy preserving frequency mining solutions have been proposed. These solutions are crucial steps in many pri...
متن کاملPrivacy Preserving Two-Party Hierarchical Clustering Over Vertically Partitioned Dataset
Data mining has been a popular research area for more than a decade. There are several problems associated with data mining. Among them clustering is one of the most interesting problems. However, this problem becomes more challenging when dataset is distributed between different parties and they do not want to share their data. So, in this paper we propose a privacy preserving two party hierar...
متن کاملPrivacy Preserving Data Mining For Horizontally Distributed Medical Data Analysis
To build reliable prediction models and identify useful patterns, assembling data sets from databases maintained by different sources such as hospitals becomes increasingly common; however, it might divulge sensitive information about individuals and thus leads to increased concerns about privacy, which in turn prevents different parties from sharing information. Privacy Preserving Distributed ...
متن کاملProtocol Design for Privacy-Preserving Data Mining Using Partial Homomorphic Encryption
With the advance of computing power, data mining techniques can extract useful information from large amount of data. In 2012, 2.5 quintillion bytes of data (1 follow 18 zeroes) are created every day. Data privacy is of utmost concern for distributed data mining across multiple parties, which may be competitors. In this thesis, we focus on the privacy preserving techniques in distributed data m...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005